Blip Image Captioning Base Bf16
MIT
This model is a quantized version of Salesforce/blip-image-captioning-base, reducing floating-point precision to bfloat16, cutting memory usage by 50%, and is suitable for image-to-text generation tasks.
Image-to-Text
Transformers